|
|
Accession Number |
TCMCG021C18385 |
gbkey |
CDS |
Protein Id |
XP_010929630.1 |
Location |
complement(join(3431601..3432272,3432359..3432826,3434070..3435544,3437214..3437613,3437796..3438356,3439999..3440110,3440990..3441006)) |
Gene |
LOC105051054 |
GeneID |
105051054 |
Organism |
Elaeis guineensis |
|
|
Length |
1234aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA268357 |
db_source |
XM_010931328.3
|
Definition |
DNA excision repair protein CSB isoform X1 [Elaeis guineensis] |
CDS: ATGATCTTTCGGTCTCGCCCATGTGCATGTGACAATATGGATGAGGAGGATGAGGACAAACTTTTGTTGAGTAGTTTGGGCATCACATCTGTGAAGCCAGAAGATATTGAACGTAAAATATTATCAGAGGCAAAAAGTGATGCTAAGTGTGGAAGTCAATCAGAGGTCTGCTCTGAGGAGCATGAGCTTGATGGAGAGCCTGAAACTGGTCCATCATCTACCAGCCGAATTAAACTTTATGATAAATTACGGGCAGTGGAAGTTGAAATTGATGCTGTAGCTTCTAGCATTGAAGCGGCTAAAGATGTTGCGTACAGTGAGAATGACCATACAGGTAATGCAGATATTAAAGAGGATAATGATAGGAGAAATGATGATGGTTCTGCTCAAGTTACATCCAATGGATTAACTCTGCAGCAGGCACTAGCAACTGACCGTTTAAGAAGCCTCAAGAAAACAAAGGCTCAACTTCAAAATGAAATCTCTAAGCTTGATGAGAATGCCACTCCTGAGGACTTTGGACATGAGAAGCTACTTGCTGACCTGGTTGAAGAGAAGTGCAAGAGGAAATCAAAAGCAGTTGAACAGTCTAATAGGGATTCAAAAAGTCATTTGAAAACTGTTGCTTACAATGAAGATGCTGATTTTGATGCAGTGCTGGATGCAGCCTCAACTGGATTTGTTGAAACTGAAAGGGATGAATTGATCCGTAAAGGAATATTAACTCCATTTCATAAGATTAAAGGCTTTGAGCGTCGTGTTCAACAACCTGCACCATCAAATAGGCATGTGCCTGAGGAAAGTGCAGCTGAAGATCATGCTTCAGCTAGCATAGCTAAAGTTGCTCAATTAATATCAGATGCGGCACAAAATCGCCCAGCAACCAAATTGCTTGATACTGTGGCCTTATCTGGACTTGATGCACCAACCCATCCTTTTCAAAGACTTAAGGCACCCTTGAAACATCCAGTCTCTCCAAAAGGAAAGGAGTTAGAGAAGAAGACACGAAAGCTGAGAAGGACAAAGAGACCTTTACCTAGCAAGAAATGGAGAAAGGTAGATTCAAAGGAGAAACTGCCTGATGGAAGTGATGAAGATTCAATGGGAGACTCAATTGCCTCAGATTATGGGGAGACTCAAGAAGAGAACACAGATGATGGGGAGCAATCTCCTGTAATCCTTGAAGGAGGATTAAAAATTCCTGCTTCTATTTATATGAACCTTTTTGATTATCAAAAAGTGGGGATGAAGTGGCTGTGGGAGCTGCACTGCCAAAGAGCTGGTGGCATTATAGGAGATGAAATGGGTCTGGGCAAGACTGTGCAGGTGATATCGTTTCTTGGTGCTTTACATTTCAGTAAAATGTATAAACCCAGCATTGTTGTTTGTCCAGTTACCCTTCTGCGACAGTGGCAGCGGGAAGCTCGAAAATGGTACCCTGACTTCAGAGTTGAGATATTGCATGATTCTGCACATGGTTTAAATAAACAGACGGTGGCAAAGTCAAGTGAAAGCGATTATGATAGCGAAGATTCCCTGGATTCTGATAATGAAAGACCTCGCCCTGCCAAGTCGGTAAAGAGGTGGAATGATTTGATTGATCGTGTTGTGCAATCAGAATCTGGGTTACTTCTCACCACATATGAGCAACTACGCATTCTAGGGGAGAAGTTGCTCGATATAGAGTGGGGATATGCCATATTGGATGAGGGTCACCGTATAAGAAATCCTAATGCTGAAGTAACTTTAGTTTGTAAACAGTTACAGACAGTTCACCGTATAATCATGACTGGTGCACCAATTCAGAACAAGCTTTCAGAACTTTGGTCCCTTTTCGATTTTGTCTTCCCTGGAAAGCTAGGTGTTTTACCTGTATTTGAGACAGAATTTGCTGTTCCTATTACAGTTGGCGGGTATGCTAATGCTACACCATTGCAAGTGTCCACAGCTTACAGATGTGCAGTTGTCTTGCGTGACTTGATAACGCCATACCTTCTTAGGCGTATGAAAGCTGATGTGAACGCCCAACTTCCCAAGAAAACTGAGCATGTCCTTTTCTGTAGCCTAACTTCAGACCAGCGATCTGTTTATAGGGCATTCCTTGCTAGTTCTGAAGTGGAGCAAATTTTTGAGGGCAGTAGAAACTCACTTTATGGAATAGATATCATGCGTAAGATTTGCAATCATCCTGATCTTTTAGAAAGAGAACACTCTGCTCTGCATCCAGACTATGGGAATCCAGAGCGGAGTGGAAAAATGAAAGTGGTTGCTCAAGTACTTAGGGTCTGGAAGGAGCAAGAACATCGTGTTCTACTCTTCGCACAGACTCAACAAATGCTAGACATTTTGGAAAACTTCCTGGCTGCAAGTGGGTATAGCTATCGGAGAATGGATGGACTTACTCCTATAAAGCAGAGGATGGCACTTATAGATGAATTTAACAACTCATCTGATGTCTTTATTTTTATCCTGACTACTAAAGTTGGGGGTTTAGGGACAAACTTAACTGGTGCTGACAGGGTTATCATATATGATCCTGACTGGAATCCTTCCACAGATATGCAGGCAAGAGAACGAGCTTGGCGGATCGGGCAGAAACGGGATGTAACGGTTTACAGGTTGATCACGCGTGGAACTATAGAGGAGAAGGTGTATCATAGGCAGATTTACAAACATTTCCTGACCAATAAGATATTGAAGAACCCTCAGCAAAGAAGATTCTTTAAAGCCAAAGACATGAAGGATCTCTTCACGCTGCAAGATGATAGAGAGGGTGGTTCTACTGAGACTTCAAATATATTCAGCCAGTTGTCTGAAGAGGTAAATGTTGGGGTTGGCAATGGCTACCAAGATAAACAAGGGTCCTCTGCAGCTTCCACTGCTCCTGTTGTGCCAGCTAAAGAAACAAACTCTCCAGGGCTTGGAGCATCTAGCTCCAACAGCAAAGGAAAAGAGATTGCCGGTCAGAGGAATGGTGAAATAGATGAAGAAACAAATATATTGAAGAGTCTTTTTGATGCTCATGGGATTCATAGTGCAATGAACCATGATGCTATTCTGAATGCCAATGATGACGATAAGATGAGGCTGGAGGAGCAAGCTTCCCGAGTTGCACGAAGGGCAGCTGAAGCCTTGCGTGAATCAAGAAGGCTCAGAAGCCGTGACAGTTTTTCTGTTCCAACATGGACCGGAAGATCAGGTGCTGCTGGGGCACCATCATCCATCCGTAGGAAGTTTGGGTCTACCATAAATACTCAAATGCTTGGTCCTTCAAAACCATCAGAAGGGTCTGCAAGTAGGCCTCCTGGTTTAGCAGCTGGAGCTTCCACTGGTAAGGCACTGTCTTCAGCTGAACTCTTGGCTAGAATCCGTGGAACTCAAGAAAGAGCAGTTGGCGATGCACTTGAGCAAGATCTAGATTTGGCATCTAGTTCGAATCAGAGAGAGAGCATTCCTGAAAACACTGTGGCATCAAAGCCTTCCCATAGGTATATGGTGGTCCAACCTGAGATCTTGATCCGCCAACTATGCACCTTCATACAGCAGAGGGGTGGGCAAACTGATTCTGCTAGCATAACACAGCACTTCAAGGACAGGATACAGTCAAAAGATTTGCCTCTGTTTAAGAATCTTCTCAAGGAGATAGCTGCACTAGAGAAAGATGCTGGTGGATCTAGATGGGTTTTAAAGCCAGAATATCAGTAG |
Protein: MIFRSRPCACDNMDEEDEDKLLLSSLGITSVKPEDIERKILSEAKSDAKCGSQSEVCSEEHELDGEPETGPSSTSRIKLYDKLRAVEVEIDAVASSIEAAKDVAYSENDHTGNADIKEDNDRRNDDGSAQVTSNGLTLQQALATDRLRSLKKTKAQLQNEISKLDENATPEDFGHEKLLADLVEEKCKRKSKAVEQSNRDSKSHLKTVAYNEDADFDAVLDAASTGFVETERDELIRKGILTPFHKIKGFERRVQQPAPSNRHVPEESAAEDHASASIAKVAQLISDAAQNRPATKLLDTVALSGLDAPTHPFQRLKAPLKHPVSPKGKELEKKTRKLRRTKRPLPSKKWRKVDSKEKLPDGSDEDSMGDSIASDYGETQEENTDDGEQSPVILEGGLKIPASIYMNLFDYQKVGMKWLWELHCQRAGGIIGDEMGLGKTVQVISFLGALHFSKMYKPSIVVCPVTLLRQWQREARKWYPDFRVEILHDSAHGLNKQTVAKSSESDYDSEDSLDSDNERPRPAKSVKRWNDLIDRVVQSESGLLLTTYEQLRILGEKLLDIEWGYAILDEGHRIRNPNAEVTLVCKQLQTVHRIIMTGAPIQNKLSELWSLFDFVFPGKLGVLPVFETEFAVPITVGGYANATPLQVSTAYRCAVVLRDLITPYLLRRMKADVNAQLPKKTEHVLFCSLTSDQRSVYRAFLASSEVEQIFEGSRNSLYGIDIMRKICNHPDLLEREHSALHPDYGNPERSGKMKVVAQVLRVWKEQEHRVLLFAQTQQMLDILENFLAASGYSYRRMDGLTPIKQRMALIDEFNNSSDVFIFILTTKVGGLGTNLTGADRVIIYDPDWNPSTDMQARERAWRIGQKRDVTVYRLITRGTIEEKVYHRQIYKHFLTNKILKNPQQRRFFKAKDMKDLFTLQDDREGGSTETSNIFSQLSEEVNVGVGNGYQDKQGSSAASTAPVVPAKETNSPGLGASSSNSKGKEIAGQRNGEIDEETNILKSLFDAHGIHSAMNHDAILNANDDDKMRLEEQASRVARRAAEALRESRRLRSRDSFSVPTWTGRSGAAGAPSSIRRKFGSTINTQMLGPSKPSEGSASRPPGLAAGASTGKALSSAELLARIRGTQERAVGDALEQDLDLASSSNQRESIPENTVASKPSHRYMVVQPEILIRQLCTFIQQRGGQTDSASITQHFKDRIQSKDLPLFKNLLKEIAALEKDAGGSRWVLKPEYQ |